NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

NeuralFeels with neural fields: Visuotactile perception for in-hand manipulation

https://doi.org/10.1126/scirobotics.adl0628

Suresh, Sudharshan; Qi, Haozhi; Wu, Tingfan; Fan, Taosha; Pineda, Luis; Lambeta, Mike; Malik, Jitendra; Kalakrishnan, Mrinal; Calandra, Roberto; Kaess, Michael; et al (November 2024, Science Robotics)
Yashinski, Melisa (Ed.)
To achieve human-level dexterity, robots must infer spatial awareness from multimodal sensing to reason over contact interactions. During in-hand manipulation of novel objects, such spatial awareness involves estimating the object’s pose and shape. The status quo for in-hand perception primarily uses vision and is restricted to tracking a priori known objects. Moreover, visual occlusion of objects in hand is imminent during manipulation, preventing current systems from pushing beyond tasks without occlusion. We combined vision and touch sensing on a multifingered hand to estimate an object’s pose and shape during in-hand manipulation. Our method, NeuralFeels, encodes object geometry by learning a neural field online and jointly tracks it by optimizing a pose graph problem. We studied multimodal in-hand perception in simulation and the real world, interacting with different objects via a proprioception-driven policy. Our experiments showed final reconstructionFscores of 81% and average pose drifts of 4.7 millimeters, which was further reduced to 2.3 millimeters with known object models. In addition, we observed that, under heavy visual occlusion, we could achieve improvements in tracking up to 94% compared with vision-only methods. Our results demonstrate that touch, at the very least, refines and, at the very best, disambiguates visual estimates during in-hand manipulation. We release our evaluation dataset of 70 experiments, FeelSight, as a step toward benchmarking in this domain. Our neural representation driven by multimodal sensing can serve as a perception backbone toward advancing robot dexterity.
more » « less
Full Text Available
Reconstructing Hands in 3D with Transformers

Pavlakos, Georgios; Shan, Dandan; Radosavovic, Ilija; Kanazawa, Angjoo; Fouhey, David; Malik, Jitendra (June 2024, CVPR)

Full Text Available
GOAT: GO to Any Thing

https://doi.org/10.15607/RSS.2024.XX.073

Chang, Matthew; Gervet, Theophile; Khanna, Mukul; Yenamandra, Sriram; Shah, Dhruv; Min, So; Shah, Kavit; Paxton, Chris; Gupta, Saurabh; Batra, Dhruv; et al (July 2024, Robotics: Science and Systems Foundation)

In deployment scenarios such as homes and warehouses, mobile robots are expected to autonomously navigate for extended periods, seamlessly executing tasks articulated in terms that are intuitively understandable by human operators. We present GO To Any Thing (GOAT), a universal navigation system capable of tackling these requirements with three key features: a) Multimodal: it can tackle goals specified via category labels, target images, and language descriptions, b) Lifelong: it benefits from its past experience in the same environment, and c) Platform Agnostic: it can be quickly deployed on robots with different embodiments. GOAT is made possible through a modular system design and a continually augmented instance-aware semantic memory that keeps track of the appearance of objects from different viewpoints in addition to category-level semantics. This enables GOAT to distinguish between different instances of the same category to enable navigation to targets specified by images and language descriptions. In experimental comparisons spanning over 90 hours in 9 different homes consisting of 675 goals selected across 200+ different object instances, we find GOAT achieves an overall success rate of 83%, surpassing previous methods and ablations by 32% (absolute improvement). GOAT improves with experience in the environment, from a 60% success rate at the first goal to a 90% success after exploration. In addition, we demonstrate that GOAT can readily be applied to downstream tasks such as pick and place and social navigation.
more » « less
Full Text Available
Adapting Rapid Motor Adaptation for Bipedal Robots

https://doi.org/10.1109/IROS47612.2022.9981091

Kumar, Ashish; Li, Zhongyu; Zeng, Jun; Pathak, Deepak; Sreenath, Koushil; Malik, Jitendra (October 2022, Proceedings of the IEEERSJ International Conference on Intelligent Robots and Systems)

Full Text Available
Image-to-Image Regression with Distribution-Free Uncertainty Quantification and Applications in Imaging

Angelopoulos, Anastasios N; Kohli, Amit P; Bates, Stephen; Jordan, Michael I; Malik, Jitendra; Alshaabi, Thayer; Upadhyayula, Srigokul; Romano, Yaniv (January 2022, International Conference on Machine Learning)

Full Text Available

Search for: All records